智能论文笔记

Trajectory Forecasting on Temporal Graphs

Görkay Aydemir , Adil Kaan Akan , Fatma Güney

分类：计算机视觉 | 机器人

2022-07-01

预测场景中代理的未来位置是自动驾驶中的一个重要问题。近年来，在代表现场及其代理商方面取得了重大进展。代理与场景和彼此之间的相互作用通常由图神经网络建模。但是，图形结构主要是静态的，无法表示高度动态场景中的时间变化。在这项工作中，我们提出了一个时间图表示，以更好地捕获流量场景中的动态。我们用两种类型的内存模块补充表示形式。一个专注于感兴趣的代理，另一个专注于整个场景。这使我们能够学习暂时意识的表示，即使对多个未来进行简单回归，也可以取得良好的结果。当与目标条件预测结合使用时，我们会显示出更好的结果，可以在Argoverse基准中达到最先进的性能。

translated by 谷歌翻译

StretchBEV: Stretching Future Instance Prediction Spatially and Temporally

Adil Kaan Akan , Fatma Güney

分类：计算机视觉 | 机器学习

2022-03-25

在自动驾驶中，在车辆周围所有代理的位置和运动方面预测未来是计划的关键要求。最近，通过将多个相机感知的丰富感觉信息融合到紧凑的鸟类视图表示以执行预测的情况下，已经出现了一种新的感知和预测的联合表述。但是，由于多个合理的预测，未来预测的质量会随着时间的推移而降低到更长的时间范围。在这项工作中，我们通过随机时间模型解决了未来预测中的这种固有的不确定性。我们的模型通过在每个时间步骤中通过随机残差更新来学习潜在空间中的时间动态。通过在每个时间步骤中从学习的分布中取样，我们获得了与以前的工作相比更准确的未来预测，尤其是在现场的空间上扩展两个区域，并在更长的时间范围内进行时间范围。尽管每个时间步骤进行了单独的处理，但我们的模型仍然通过解耦动态学习和未来预测的产生而有效。

translated by 谷歌翻译

Frustum Fusion: Pseudo-LiDAR and LiDAR Fusion for 3D Detection

Farzin Negahbani , Onur Berk Töre , Fatma Güney , Baris Akgun

分类：计算机视觉

2021-11-08

大多数自治车辆都配备了LIDAR传感器和立体声相机。前者非常准确，但产生稀疏数据，而后者是密集的，具有丰富的纹理和颜色信息，但难以提取来自的强大的3D表示。在本文中，我们提出了一种新的数据融合算法，将准确的点云与致密的，但不太精确的点云组合在立体对。我们开发一个框架，将该算法集成到各种3D对象检测方法中。我们的框架从两个RGB图像中的2D检测开始，计算截肢和它们的交叉点，从立体声图像创建伪激光雷达数据，并填补了LIDAR数据缺少密集伪激光器的交叉区域的部分要点。我们训练多个3D对象检测方法，并表明我们的融合策略一致地提高了探测器的性能。

translated by 谷歌翻译

A Late Multi-Modal Fusion Model for Detecting Hybrid Spam E-mail

Zhibo Zhang , Ernesto Damiani , Hussam Al Hamadi , Chan Yeob Yeun , Fatma Taher

分类：人工智能

2022-10-26

In recent years, spammers are now trying to obfuscate their intents by introducing hybrid spam e-mail combining both image and text parts, which is more challenging to detect in comparison to e-mails containing text or image only. The motivation behind this research is to design an effective approach filtering out hybrid spam e-mails to avoid situations where traditional text-based or image-baesd only filters fail to detect hybrid spam e-mails. To the best of our knowledge, a few studies have been conducted with the goal of detecting hybrid spam e-mails. Ordinarily, Optical Character Recognition (OCR) technology is used to eliminate the image parts of spam by transforming images into text. However, the research questions are that although OCR scanning is a very successful technique in processing text-and-image hybrid spam, it is not an effective solution for dealing with huge quantities due to the CPU power required and the execution time it takes to scan e-mail files. And the OCR techniques are not always reliable in the transformation processes. To address such problems, we propose new late multi-modal fusion training frameworks for a text-and-image hybrid spam e-mail filtering system compared to the classical early fusion detection frameworks based on the OCR method. Convolutional Neural Network (CNN) and Continuous Bag of Words were implemented to extract features from image and text parts of hybrid spam respectively, whereas generated features were fed to sigmoid layer and Machine Learning based classifiers including Random Forest (RF), Decision Tree (DT), Naive Bayes (NB) and Support Vector Machine (SVM) to determine the e-mail ham or spam.

translated by 谷歌翻译

Explainable Artificial Intelligence to Detect Image Spam Using Convolutional Neural Network

Zhibo Zhang , Ernesto Damiani , Hussam Al Hamadi , Chan Yeob Yeun , Fatma Taher

分类：计算机视觉

2022-09-07

图像垃圾邮件威胁检测一直是互联网惊人扩展的流行研究领域。这项研究提出了一个可解释的框架，用于使用卷积神经网络（CNN）算法和可解释的人工智能（XAI）算法检测垃圾邮件图像。在这项工作中，我们使用CNN模型分别对图像垃圾邮件进行了分类，而hoc XAI方法包括局部可解释的模型不可思议的解释（Lime）和Shapley添加说明（SHAP），以提供有关黑手盒CNN的决定的解释关于垃圾邮件图像检测的模型。我们在6636图像数据集上训练，然后评估拟议方法的性能，包括垃圾邮件图像和从三个不同的公开电子邮件Corpora收集的垃圾邮件图像和正常图像。实验结果表明，根据不同的性能指标，提出的框架实现了令人满意的检测结果，而独立模型的XAI算法可以为不同模型的决策提供解释，以比较未来的研究。

translated by 谷歌翻译

Monetisation of and Access to in-Vehicle data and resources: the 5GMETA approach

Djibrilla Amadou Kountche , Fatma Raissi , Mandimby Ranaivo Rakotondravelona , Edoardo Bonetto , Daniele Brevi , Angel Martin , Oihana Otaegui , Gorka Velez

分类：计算机视觉

2022-08-24

当今的车辆越来越多地嵌入了产生大量数据的计算机和传感器。这些数据是为了内部目的而利用的，随着连接的基础架构和智能城市的开发，车辆相互交互，以及与生成其他类型数据的道路使用者相互作用。对这些数据和车载资源及其货币化的访问面临本文提出的许多挑战。此外，与H2020 5GMETA项目中所面临的开放和新颖方法相比，最重要的商业解决方案。

translated by 谷歌翻译